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(54) Title: ACID-LABILE ISOTOPE-CODED EXTRACTANT (ALICE) AND ITS USE IN QUANTTTATrVE MASS SPECTRO- 
METRIC ANALYSIS OF PROTEIN MIXTURES 

(57) Abstract: The method of the invention provides novel compounds, termed aad=labile isotope-coded extractants (ALICE), for 
quantitative mass spectrometric analysis of protein mixtures. The compounds contaiivajhiol -reactive group that is used to capture 
t*^ cysteine-containing peptides from all peptide mixtures, an acid-labile linker, and a non-biological polymer. One of the two acid-labile 
linkers is isotopically labeled and therefore enables the direct quantitation of peptides/proteins through mass spectrometric analysis. 
Because no functional proteins are required to capture peptides, a higher percentage of oi^nic solvent can be used to solubilize the 
peptides, particularly hydrophobic peptides, through the binding, washing and eluting steps, thus permitting much better recovery 
of peptides. Moreover, since the peptides are covalentiy linked to the non-biological polymer (ALICE), more stringent washing 
^2 is allowed in order to completely remove non-specifically bound species. Finally, peptides captured by ALICE are readily eluted 
from the polymer support under mild acid condition with high yield and permit the direct down stream mass spectrometric analysis 
^5 without any further sample manipulation. In combination with our novel dual column two dimensional liquid chromatography- 
^ mass spectrometry (2D-LC-MS/MS) design, the ALICE procedure proves to a general approach for quantitative mass spectrometric 
analysis of protein mixtures with better dynamic range and sensitivity. 
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ACID-LABILE ISOTOPE-CODED EXTRACTANT (ALICE) AND ITS USB IN 
QUANTITATIVE MASS SPECTROMETRIC ANALYSIS OF PROTEIN 

MDCfURES 



BACKGROUND OF THE INVENTION 
The present invention relates to the field of high-throughput quantitative 
protein analysis and, more specifically, to novel reagents for use in such analysis. 
Most approaches to quantitative protein analysis are accomplished by 
10 combining protein separation, most commonly by high-resolution two-dimensional 
polyaciylamide gel electrophoresis (2D-PAGE). with mass spectrometry (MS)-based 
sequence or tandem mass spectrometry (MS/MS)-based sequence identification of 
selected, separated protein species. 

S. P. Gygi, et al.. Nature Biotech, 17:994-999 (October 1999) describes an 
15 approach to quantitative protem analysis based on a class of reagents termed isotope- 
coded affinity tags (ICAT), which consist of three functional elements: a specific 
chemical reactivity, an isotopicaUy coded Unker, and an affinity tag. The reagents 
described by Gygi utilize biotin as the affinity tag and rely upon biotin-avidin affinity 
binding to isolate the cysteine-containing peptides from the complex peptide mixture. 
20 Although the ICAT approach has many advantages over the traditional 2D- 

PAGE/MS approaches, it does possess some intrinsic limitations. For example, ICAT 
adds a relatively large chemical moiety onto the cysteine-containmg peptides and this 
functionaUty is very labile under collision induced dissociation (CID) condition and 
tinis complicates the downstream data analysis. Non-specific binding is also a concern 
25 since the enrichment reUe^ on non-covalent affinity binding between a protein (avidin) 
and tiie biotinylated peptides. Fmally, the captured peptides are not readUy eluted ftom 
the avidin beads with high recovery using MS-compatible conditions. Thus, tiiere is a 
need in the art for additional reagents and methods for improving performance in • 
quantitetive mass spectrometric analysis of protein mixtures. 

30 
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SUMMARY OF THE INVENTION 
The invention provides polymer-based compomds us^ 
analysis of miKtures containing proteins. Advantageously, the compounds of the 
invention bind covalently with the peptides vs^hich they are used to tag, permitting the 
5 tagged peptides to be subjected to more rigorous washing techniques. Thus, the 
tagged peptides are more readily purified, without nonspecifically boimd species. 
This results in lower background on MS spectra and thus provides an increase of 
dynamic range and sensitivity in quantitation and identification of the proteins. 

In one aspect, the invention provides a method for the. quantitative analysis of 

10 mixtures containing proteins. The method involves (a) reducing the disulfide bonds in 
the proteins of a sample to provide free thiol groups in cysteine-containing proteins; 
(b) blocking free thiols on the reduced proteins with a blocking reagent; (c) digesting 
the proteins in the sample using an enzyme such as trypsin; (d) reducing the peptides 
following the digestion step; (e) reacting cysteine-containing peptides with a reagent, 

15 wherein the reagent comprises a thiol-specific reactive group covalently boimd to a 
polymer tag via a linker, wherein the linker can be differentially labeled with stable 
isotopes (optionally prior to or following any of the reduction steps); (f) washing the 
polymer-bound peptides to remove non-covalently bound compoimds; (g) eluting the 
cysteine-containing peptides; and (h) subjecting the retrieved peptides to quantitative 

20 mass spectrometry (MS) analysis. In one embodiment, the method further involves 
performing steps (a) to (d) on a second sample; reacting cysteine-containing peptides 
in the second sample with a stable isotope-labeled form of the reagent, wherein in 
reacting step (e), the reagent used is a non-isotope labeled forrii of the reagent; mixing 
the peptides of the reacted sample following step (e) and the reacted second sample; 

25 and performing steps (g) and (h) on the peptides in the mixture. 

In another aspect, the invention provides a compound useful for capturing 
cysteine-containing peptides. This compound is composed of a thiol-specific reactive 
group attached to a non-biological polymer via a linker. In one desirable 
embodiment, the reagent has the formula: Al - Linker - A2 - polymer, wherein Al is a 

30 thiol-reactive group and A2 is an acid labile group to which the polymer is attached. 
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In yet anoflier aspect, flie invention provides a reagent kit for the mass spectral 
aiiaiysis of proteins that cbrnprises a cpmpoiind of the invent 

Other aspects tod advantages of the present invention are described 
the followmg detailed description of the preferred embodiments thereof. 

5 

BRIEF DESCRIPTION OF THE DRAWINGS 
Fig. 1 A and Fig. IB provides a schematic of the automated 2D-LC/MS System 
of the invention. 

10 DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides a novel approach for the quantitative analysis 
of proteins using acid-labile isotope coded extractants (ALICE) which are useful for 
capturing cysteine-containing peptides. The advantage of this approach over the prior 
art, is that it replaces biotin-avidin affinity binding vsdth acid-labile covalent binding to 

15 retrieve cysteine-containing peptides from the mixture. Since the binding is covalent, 
more stringent detergents or organic solvents can be used during the procedure to keep 
hydrophobic proteins and peptides m the solution and thus maximize the overall 
peptide recovery. Furthermore, the compounds and method of the invention avoid 
nonspecific peptide-protein binding. Removal of all detectable non-covalently bound 

20 species during the washing step(s) is also accomplished. Thus, the final cysteine- 
containing peptide solution is much less contaminated, resulting in higher sensitivity 
and dynamic range of MS analysis. Lastly, since the ALICE label is small in size and 
does not undergo firagmentation during MS/MS 2^ 
downstream MS analysis and database searching. 

25 In one embodiment, the present invention provides a compoimd of the 

formula: Al - Linker - A2 - polymer, wherein Al is a fliiol-reactive group and A2 is 
an acid labile group to which the polymer is attached. Alternatively the acid labile 
group may be absent and the polymer may be attached directly to the linker. 

Most preferably, the polymer is a non-biological polymer. As used herein a 
30 non-biological polymer includes inorganic polymers and organic polymers which 
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form a covalent bond with the acid-labile group, where present, or the linker. 
Suitably, an organic polymer selected does hot interfere w 

method of the invention, e.g., is stable under basic conditions and in the presence of 
the detergents and/or organic solvents required to maintain the mixture in solution. Li 

5 one suitable embodiment, the polymer used in the invention is a solid substrate 

composed of a homopolymer or a heteropolymer containing polystyrene, polyethylene, 
polyacrylamide, polyaciylein, polyethylene glycol, or the like. Suitable polymers and 
solid substrates, e.g., resins, beads or the like, are available from a variety of 
commercial sources including Sigma-Aldrich, NovaBiochem, and Beckman-Coulter, 

10 or may be synthesized using known techniques. An example of one suitable synthesis 
technique is provided in Example 1 below. However, the invention is not so limited. 

In one embodiment, the polymer is covalently bound to the linker via an acid- 
labile group that provides the compound of the invention with the ability to be readily 
eluted using an acidic reagent. In one preferred embodiment, the acid-labile group 

15 bound to the polymer has the following structure: 

Opolymer 




25 

in which the linker is -CONH-, -COO-, or another amide or ester. However, other 
structures can be readily synthesized to contain other suitable groups that provide 
similar qualities to the compoimd in terms of stability and accessibility to acid elution. 
30 Examples of suitable acid-labile groups include: 
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10 




Polymer 



Siber Linker: 

Linker 




Wang Linker: 



Linkei — O — H2C — i \— O— CH3 Polymer 
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In certain embodiments, this function may be provided by the linker, and the acid 
labile group may be absent 

The linker is any structure that may be differentially labeled with stable 
isotopes for xise in MS techniques. In one embodiment, the linker contain from 1 to 

5 100 atoms in length, about 3 to about 50 atoms in length, or about 5 to about 15 atoms 
in length, which are composed of carbon, and optionally, one or two atoms selected 
from O, S, NH, NR, NR\ CO, C(0)0, C(0)S, S-S, SO2, C(0)-NR', CS-NR% or Si-O. 
Optionally, one or more of the C atoms may be substituted with a small alkyl (CrCe), 
alkenyl, alkoxy, aryl, or diaryl groups. For example, the linker may be an alkyl, 

10 alkenyl, or alkynyl group, optionally substituted as described above. In another 

example, the linker may itself contain one or more O, S, NH, NR, NR', CO, C(0)0, 
C(0)S, S-S, SO2, C(0>NR', CS-NR', Si-O groups bound to one or more C atoms, 
which may be optionally substituted. 

In one embodiment, the linker is a structure (e.g., an alkyl group) which 

15 contains a substitution of about four to about twelve atoms with a stable isotope. 

However, in certain embodiments, it is desirable for the linker to contain substitutions 
of at least six atoms with a stable isotope. For example, for peptides at the higher end . 
of the molecular weight range at which MS is useful (e.g., about 2000 Da to 3500 Da) 
it may be desirable for the linker to contain eight, ten, twelve or more substitutions, in 

20 order to achieve the differential analysis required; whereas peptides at the lower end 
of the molecular weight range for MS (e.g., about 500 to 2000 Da) may require only 
four to six substitutions. For the selected number of substitutions, any one or more of 
the hydrogen, nitrogen, oxygen, carbon, or sulftu: atoms in th^ ; 
with ttieir isotopically stable isotopes: ^H, "C, ^''O, '*0, or ^S. 

25 Thus, the linker group has a structure tbat accommodates the number of 

isotope substitutions desired. The selection of this structure is not a limitation of the 
present invention. One or more of the atoms in the linker can be substituted with a 
stable isotope to generate one or more substantially chemically identical, but 
isotopically distinguishable compoxmds. Additionally or alternatively, the linker also 

30 optionally provides desired acid labile properties to the compound. 
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The compound of the invention further contains a functional group that is 
reactive, preferably specificaUy, with cysteine residues. Desirably, the reactive group 
is selected from the groiq) consisting of either maleimide (see below) 



10 



15 





N — and 
O o 

or a-haloacetyl groups such as X-CH2CO-. Most suitably, the X is selected from 
halogens such as iodine, bromine, and chorine to form iodoacetyl, bromoacetyl, or 
chloroacetyl functionalities. 

In another alternative, the thiol-reactive group may be selected from other a-, 
P-conjugated double bond structures, such as 




and 




and the like. Still other reactive group can readily be synthesized to contain other 
thiol-specific reactive groups for use m binding cysteine-containing peptides. 
20 In one preferred embodiment, a compound of the invention has the formula: 




p-polymcr 



Light ALICE 



In one desirable embodiment, this compound is isotopically modified as foUows. 
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However, the invention is not so limited. One of skill in the art can readily provide 
5 light ALICE with other stable isotopes. Further, one of skill in the art can readily 
produce other siiitable compounds in view of the guidance provided herein. 

METHOD OF USING THE COMPOUNDS OF THE INVENTION 

The compoxmds of the invention are particularly useful in mass spectrometric 

10 methods for quantitation and identification of one or more proteins in a mixture. The 
peptides analyzed by the method of the invention are most preferably about 500 
Daltons (Da) to about 3500 Da in size, but may be larger. Suitably, these peptides are 
formed upon enzymatic digestion of proteins in a complex mixture. The protein 
mixture may be a sample firom a cell or tissue culture, or biological fluids, cells or 

15 tissues. Samples from a culture include cell homogenates and cell fractions. 

Biological fluids include xirine, blood (including, e.g., whole blood, plasma and sera), 
cerebrospinal fluid, tears, feces, saliva, and lavage fluids. The mixtures may include 
proteins, lipids, carbohydrates, and nucleic acids. The methods of the invention 
employ MS and (MS)° methods. Currently, matrix assisted laser desoiption ionization 

20 MS (MALDI/MS) and electrospray ionization MS (ESI/MS) methods are preferred. 
However, a variety of other MS and ^S)" techniques may be selected. 
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In one embodiment, the invention provides a method for quantitative analysis 
ofa proteome using the compound of the inyentiori- Typically, a sample is obtained 
from a source, as defined above/ The sample may be compared to a reference protein 
mixture, which is obtained as a sample from the same source or may be obtained from 
5 another source. Where a sample protein mixture is to be compared to a second sample 
or a reference protem mixture, these mixtures are processed separately, applymg 
identical reaction conditions, with the exception fliat one sample will be reacted with 
the compound containing heavy stable isotopes. Where samples are not to be 
compared, separate processing to the pomt of reaction with the compound(s) of the 
10 invention is not necessary, but is pennitted. 

Typically, the protein sample is solubilized in a suitable buffer that may 
contain an organic solvent. Throughout the entire procedure except the final peptide 
elution step, the pH of the mixture is maintained under basic conditions. Most 
suitably, the pH is maintamed between 6.5 and 9, more preferably about 7.5 to 8.5, 
15 and most preferably about 7.2 to 7.5. 

The disulfide bonds of the proteins in the sample(s) or reference mbctures are 
reduced to free SH groups. Optionally, this step may be combined with solubilization 
of the protein or protein mixture, refenred to above. Suitable reducing agents include 
tri-n-butylphosphme (TBP), 2-mercaptoethanol, dithiothreitol, and tris-(p- 
20 carboxyethyl) phosphme. However, other suitable reducing agents may be 

substituted. In one embodunent, disulfide bonds m 2 mg of a protem are denatured 
using 8M urea, 200 mM ammonium bicarbonate, 20 mM CaCh, 5 pmole TBP, which 
has been pre-dissolved in 20iiL of acetonitrile (ACN) and incubated for one hoyr at 
about 37°C. In another embodiment, a protein may be incubated in 50 mM Tris 
25 buffer, 6 M guanidine-HCl, 5 mM TBP at pH 8.5 for 1 hour at 37^C. However, other 
concentrations of these components and/or other reducmg agents, buffered to a pH m 
the basic range may be selected and mcubated for varying lengths of tunes. 

Free thiols (SH) are blocked usmg a suitable blocking reagent, e.g., methyl 
methane thiosulfonate (MMTS), which functions under the basic conditions provided 
30 and does not interfere with the perfomiance of the followmg steps. Although MMTS 

9 
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is preferred, other suitable blocking reagents, including, without limitation, o- 
methylisoureai may be selected by one of skiU in the art^^ 

TTie proteiiis in the samples are en:^maticiaUy digeste^^ A suitable protease for 
use in this method may be readily selected from among proteases that are compatible 

5 with the basic conditions and the procedxire. Under certain circumstances, it may be 
necessary to dilute the sample mixture until any denaturing solubilizing agents in the 
sample are diluted to a point at which they are compatible with the activity of the 
protease or proteases used, hi one embodiment, the protease is trypsin. In another 
embodiment, the protease is the endoproteinase Lys-C (commercially available, e.g., 

10 from Promega, Roche Molecular Biochemical). In still another example, a mixture of 
proteases that have similar activity levels at basic pH is used. Such proteases may 
include aminopeptidases, carboxypeptidases, among others. Alternatively, the protein 
mixture is subjected to more than one digestion step. For example, the protein 
mixture may be subjected to digestion with Lys-C, followed by digestion with trypsin. . 

15 Multiple digestions are particularly desirable where the mixture is a complex mixture. 
One of skill in the art can readily determine whether a single digestion step, or 
multiple steps, are required. In yet another alternative, protein digestion may be 
omitted where the sample contains peptides, polypeptides or small proteins (e.g., 
about 500 to 5000 Da). 

20 Suitably, the peptides are again reduced prior to being reacted with the 

compounds of the invention to remove the blocking reagents. The reduction step is 
performed using the reagents described above. In one suitable embodiment, the 
niixture is reduced by incubation with 5 jmiole of re 

However, other suitable concentrations, regents, incubation temperatures and times 
25 may be readily substituted. 

A selected compound of the invention and a corresponding isotopically heavy 
compound are reacted with the samples. Typically, the reference sample is labeled 
with the isotopically heavy compound and the experimental sample(s) are labeled with 
the isotopically light form of the compound. However, the labeling may be reversed. 



10 



wo 02/48717 



PCT/USOl/50745 



Optionally, this labeling reaction may be performed at any stage of the method, e.g., 
prior to any of the reduction steps. 

After completion of the t^ging reaction, disfined aliquots of the samples 
labeled with isotopically different compounds (e.g., corresponding light and heavy 
5 compounds) are combined and all the subsequent steps are perfomaed on the pooled 
samples. Preferably, equal amounts of each sample are pooled. 

The pooled samples are washed in order to remove any non-covalentiy bound 
species. The use of the compounds of the invention permits the use of harsher 
washing steps than prior art reag«its can withstand. For example, one suitable 
10 method utilizes 5 X 1 mL of 50% acetonitrile (ACN), 5 X 1 mL of 30% ACN. 5 X 1 
mL of 90% ACN, 5 X ImL (non-diluted) ACN, and 10 X 5 mL dichloromethane. 
However, the concentration of ACN may be varied. Alternatively, other suitable 
solvents may be substituted. Examples of suitable solvents include organic solvents 
witii polarity properties similar to acetonitrile or dichloromethane. Yet another 
15 suitable method utilizes high concentrations of organic solvents, which effectively 
removes any residual detergents or surfactants. 

The tagged peptides are selectively retrieved by acid elution, which breaks the 
bond between the linker or acid labile group and the polymer to which it is covalentiy 
bond allowing the peptides tagged wifli the light or heavy compounds of the invention 
20 to be eluted. For example, tiie last washing may be eluted using 1 % to 5% 

trifluoroacetic add (TFA) in dichloromethane (CH2CI2). Using the method of the 
invention, peptide recovery is e,stimated at above 75%. Suitably, recovery may be 
even Wgher, eg, above 80%, 85%, and 90%^ 
utilized. 

25 The isolated, derivatized peptides retrieved are tiien analyzed using MS 

techniques. Both the quantity and sequence identity of the proteins from which the 
tagged peptides originated can be determined by autonoated multistage MS. This is 
achieved by tiie operation of the mass spectrometer ui a dual mode in which it 
alternates in successive (scans between measuring the relative quantities of p^tides 

30 eluting from tiie capillary column and recording the sequence information of selected 

11 
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peptides. Peptides are quantified by measuring in the MS mode the relative signal 
intensities for pairs of peptide ions of identical sequence fliat are tagged with the 
isotopically light or heavy forms of the compounds of tiie invention, respectively, and 
which therefore diSer in mass by the mass differential encoded within Ae affinity- 
5 tagged reagent. Peptide sequence information is automatically generated by selecting 
peptide ions of a particular mass-to-charge (m/z) ratio for collision-induced 
dissociation (CJD) in the mass spectrometer operating in the MS" mode. Using 
computer-searching algorithms, the resulting CBD spectra are then automatically 
correlated with sequence databases to identify the protein from which the sequenced 

10 peptide originated. A combination of the results generated by MS and MS" analyses 
of the differentially labeled peptide samples therefore determines the relative 
quantities as well as the sequence identities of the components of the protein mixtures 
in a single, automated operation. Alternatively, more accurate relative quantitation 
may be obtained by MS analysis of the isolated peptides with the mass spectrometer 

15 operating at MS mode only [see Automated LC/MS in Example 2: Instrumentation] 
Apparatuses for performing MALDI-MS and techniques for using such 
apparatuses are described in International Publication No, WO 93/24835, US Patent 
5,288,644, ILBeavis and B.Chait,Pwc.JVar/.^carf. ScL USA, 87:6873-6877 .(1990); 
B. Chait and K. Standing, InL J. Mass Spectrom, Ion Phys., 40:185 (1981) and 

20 Mamyrin et al, Sov. Phys, JETP, 37:45 (1973), all of which are incorporated by 

reference herein. Briefly, the frequency tripled output of, e.g., a Q-switched Lumonics 
HY400 neodynium/yttrium aluminum garnet lawer CNd-YAG") (355 nm, 10-nsec 
output pulse) is focused by a leiis (12-inch focal length) tihrpugh a fused silica window 
onto a sample inside the mass spectrometer. The product ions fomied by the laser are 

25 accelerated by a static electric potential of 30 kV. The ions then drift down a 2-m tube 
maintained at a vacuum of 30 pPa and their arrival at the end of the tube is detected 
and recorded using, e.g., a Lecroy TR8828D transient recorder. The transient records 
of up to 200 individual laser shots are summed together and the resulting histogram is 
plotted as a mass spectrum. Peak centroid determinations and data reduction can be 

30 performed using a VAX workstation or other computer system. However, other 
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apparatuses and techniques are known and may be readily utUized for analysis of the 
peptides of the inventioiL 

REAGENT KIT 

5 The invention further provides a reagent kit for the analysis of proteins by 

mass spectral analysis. Typically, such a kit wiU contain one or more compounds of 
ihe mvention. Most suitably, the kit awH contain a set of substantially identical, 
differentially labeled (isotbpically Ught and heavy) compounds. In one desirable 
embodiment, the kit will contain the compounds of the invention such that the 
10 polymer portion of the compound also serves as a soUd support, e.g., ahead or resin. 
The kit may further contain one or more proteolytic enzymes, blocking reagents, 
solubilizing detergent cocktaUs, or wash solutions. Other suitable components will 
be readily apparent to one of skill in the art 

The method and kit of the invention may be used for a variety of clinical and 
15 diagnostic assays, in which the presence, absence, deficiency br excess of a protein is 
associated with a normal or disease state. The method and kit of the invention can be 
used for qualitative and quantitative analysis of protein expression in cells and tissues. 
The method and kit can also be used to screen for proteins whose expression levels in 
ceUs or biological fluids are affected by a drug, toxin, environmental change, or by a 
20 change in condition or ceU state, e.g., disease state, maUgnancy, site-directed 
mutation, ^ne therapy, or gene knockouts. 

The fpllowiii^ examples are provided to illustrate the invention and do not 
limit the scope thereof. One skilled in the art wiU appreciate that although specific 
25 reagents and conditions are outliiied in the foUowing examples, modifications can be 
made which are meant to be encompassed by the spirit and scope of the invention. 
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EXAMPLES 
EXAMPLE 1 - SYimffiSIS OF THE C0^^ 

A. PreparationOf Linker And Affinity Tag 

A solution of maleic anhydride (0.98 g, 10.0 nunol in 1 S ml of acetic 
5 acid) was added to a solution of 6-aminocaproic acid (131 g, 10 mmol in 5 ml of 
acetic acid). The resulting mixture was stirred at room temperature for two hours. 
After two hours, the mixture was heated to reflux (oil bath temperature about 110- 
llO^C) for four and a half hours. The acetic acid was removed in vacuum and 3.3 g 
of a light yellow solid was obtained. This solid was chromatographed (20% ethyl 
10 acetate in hexanes, then 50% ethyl acetate in hexanes) and gave 0.92 g of pure target 
compound (6-(2,5-dioxo-2,5-'dihydro-pyrrol-l-yl)-hexanoic acid; 43% yield). This 
reaction is illustrated in the scheme provided below, in which acetic acid is 
abbreviated as HOAc. 
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B, Preparation Of Resiii 

The protected polymer, purchased conimercially as 
Seiber resin (1 g, 0.15 mmol/g) was stirred inN, N-dimethylformamide (DMF) (8 

5 mL) and then piperidine (2 mL) was added. The reaction mixture was stirred for ten 
minutes and then the solid was filtered and washed with methylene chloride and tiien 
dried under vacuum. This dry solid was then again stirred with piperidine (2 mL) in 
DMF (8 mL) for another ten minutes. The thin layer chromatography (TLC) was 
recorded and showed no trace of the fluorenyhnethyoxycarbonyl (Fmoc). The solid 

10 was then filtered and washed with methylene chloride, dried under low pressure to 
give about 1 g of the free amine polymer. This reaction is illustrated by the synthetic 
scheme below. 

15 
Polymer 

20 



Poller 

The polymer is a copolymer of polyetiiylene glycol and polystyrene. 

30 



NH-Fmoc 




15 
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C. Preparation Of Compound Of The Invention 

The deprqtected polymer (1 g, 0.15 mmol/g) synthesized as described 
in part B Was stirred in DMF (10 mL). To this mixture was added sequentially the 
compound which resulted from the reaction described in part A (0.095 g, 0.45 mmol), 
5 1-hydroxybenzotriazole (HOBT) (0.06 g, 0.45 mmol) and N, N- 

dicyclohexylcarbodiimide (DCC) (0.102 g, 0.5 mmol). The reaction mfacture was 
stirred for three hours and the solid filtered and washed successively with ethyl 
acetate, ether and methylene chloride. The solid was then dried in vacuum and gave 
about 1 g of the product illustrated below (ALICE of the invention). 
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EXAMPLE 2 - INSTRUMENTATION 

The present invention was carried out utilizing techniques and instrumentation 
known to those of skill in the art combined with a novel method of usiog the same. 
Specifically, data was obtained using automated LC/MS alone as well as using a novel 
5 automated 2-dimehsional LC/LC/MS system using instrumentation available in the 
art. These instruments and methods of using the same are described below. 
A. Automated LCVMS 

Automated LC/MS was accomplished using a LC/MS MicroMass Q- 
ToF^ mass spectrometer (Micromass, Manchester, UK) equipped with an ABI 140 C 
10 microgradient syringe pump system (Applied Biosystems, Framingham, MA). The 
sample was injected onto a strong cation exchange (SCX) colimm, a 100 ^im x 6 cm 
IntegraFrit column (New Objectives, Wobum, MA) packed with PolySULFOETHYL 
A, 12 pm, 300 A (PolyLC Inc., Columbia, MD). The sample was then eluted onto a 
RP-C18 column, a 75 fmi x 10 cm PicoFrit column (New Objectives, Wobum, MA) 
15 packed with YMC-Gel 10 pM CIS beads (YMC Inc., Wihnington, NC) using a 

solution of 500 mM KCl m 0.1 M acetic acid. The RP-C18 column was equiUbrated 
with 96% acetic acid/4% ACN and then the following gradient was run: (i) 4-65% 
RP-B over 75 minutes, (ii) 65-98% RP-B over the next 7 minutes, (iii) a hold at 98% 
RP-B for 5 minutes, and (iv) 98-1% RP-B over the next 3 mmutes at 250 jiL/min. 
20 Mobile-phase buffers were for RP-A: 0.1 M acetic acid, 1 % ACN and RP-B: 0.1 M 
acetic acid, 90% ACN. Data was acquired in the MS mode only. 
B. Automated 2D-LC/MS/MS 

Automated 2D-LC/MS/MS was accomplished using the system as 
shown' in Figs 1 A and IB. Specifically, a 2D LC-MS/MS Fmnigan LCQ Deca ion 
25 trap mass spectrometer was fitted with an Applied Biosystems 140C microgradient 
syringe pump system (Applied Biosystems, Framingham, MA), as the reverse phase 
pump (RP), and an Agilent 1 100 series bmary pump, as the strong cation exchange 
(SCX) and desalting pump. The pumps were attached to a VICI 10 port microbore 
two-position valve with a microelectric actuator (Valco Instruments CO Inc., Houston, 
30 TX). A strong cation exchange column, 50 x 1 mm PolySULFOETHYL A (PolyLC 
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Inc., Columbia, MD), was attached to port 9 and two 75 mm x 10 cm IntegraFiit 
columns (New Objectives, Woburn, MA) packed with YMC-Gel 10 )jm CIS beads 
(YMC Inc., Wilimngton, NC) were attached between ports 2 and 5, and 7 and 10, 
respectively. Another 75 pm x 3 cm CI 8 column packed in a PicoFrit colimm (New 
5 Objectives) was placed in between the titanium voltage union and the heated capillary 
of the mass spectrometer, to restore a loss of resolution from the valve and the 
titanium imion. 

Automation between the mass spectrometer, pumps and valve was 
accomplished using contact closures. First, the sample was loaded onto the SCX 

10 column using a Rheodyne injection valve (Rheodyne, Rohnert Park, CA) with the port 
valve at position 1 0 as shown in Fig. IB so that any unbound peptides would bind to 
the RP-18 column and elute in fraction 0. With this dual CI 8 colimm design, while 
one RP-C18 column (column A in Fig. 1 A) is being on-line with the mass 
spectrometer for peptide separation, the other CI 8 column (Column B in Fig. 1 A) is 

15 being regenerated, loaded with peptide sample eluted from the SCX column and 

desalted. After each HPLC gradient run is completed, the positions of the two RP-CIS 
columns were switched over using the two-position ten-port valve (Fig. IB) so that 
the time delay for equilibrating, sample loading from SCX and desalting was 
effectively eliminated. Peptide factions were eluted from the SCX column onto one 

20 RP-C18 column using the following salt steps: (i) 5%, (ii) 10%, (iii) 15%, (iv) 20%, 
(V) 30%, (vi) 40%, (vii) 50%, (viii) 65%, (ix) 85%, (x) 98%, (xi) 98%, (xii) 98%, and 
(xiii) 98%, SCX-B:SCX-A, for 10 minutes at 1 nL/min. Before each elution, 100% 
SCX-A was flowed at 1 fiL/ihiil for 20 minutes to equilibrate the RP CI 8 column and 
after each salt elution, 100% SCX-A was flowed at 1 ^L/min for 20 minutes for 

25 elutions (i) to (iv), 25 minutes for elutions (v) and (vi), 30 minutes for elutions (vii) 
and (viii), and 35 minutes for elutions (ix) to (xiii). The flow was then slowed down 
to 200 nL/min for the remainder of time to rinse the salt from the RP CI 8 column. 
Peptides were eluted from one CI 8 colxmm into the mass spectrometer using a linear 
RP gradient: a) 1-65% RP-B over 75 minutes, b) 65-98% RP-B over the next 7 

30 minutes, c) a hold at 98% RP-B for 5 minutes, and d) 98-1% RP-B over the next 3 
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minutes at 400 nL/min. Mobile-phase buffers were, RP-A: 0.1 M acetic acid, 1 % 
ACN; RPrB: 0.1 M acetic acid, 90% ACN; SCX-A: 0.1 M acetic acid, 1% ACN; 
SCX-B: 500 mM KCl. (Figs. 1 A and IB). 

5 EXAMPLE 3 - PREPARATION OF PROTEOMES FOR MS ANALYSIS 

2 mg of bovine serum albumin (BSA) were solubUized in 200 [iL of 8 M urea, 
200 mM ammonium bicarbonate, and 20 mM CaCk. 5 frniole of tributyi phosphine 
(TBP) pre-dissolved in 20 |iL of acetonitrUe (ACN) was added into the solubilized 
protein mixture and the resulting solution was incubated at STC for one hour. To the 
10 protein mixture was added 1 1 pmoles of MMTS and the mixture was vortexed for 10 
minutes. The protein solution was diluted 1 :1 witii 100 mM ammonium bicarbonate 
and 40 jig of Lys-C (2% w/w) were added. This mixture was then incubated at 37°C 
for 5 hours. The resulting solution was diluted 1:1 with water and then proteins were 
further digested with trypsin (2% w/w) at 37X for 15 hours. The resulting peptide 
15 solution was dried and then reconstituted with 50% acetomtrile/200mM sodium 
phosphate (pH 7.2). Disulfide bonds on tiie cysteine-containing peptides were 
reduced with TBP (5 }xmoles) at 37°C for one hour. Then 50 mg of the ALICE resin 
(about 1 1.5 nmole reactive sites) was added into the peptide solution and the solution 
vortexed for 1 hour at room temperature. The solutions were combined and loaded 
20 onto a column (glass type with teflon cockstop) and tiie resin was washed with the 
following solvent in sequence: 1) 5X 1 mL of 50% ACN, 2) 5X 1 mL of 30% ACN, 
3) 5 X 1 mL of 90% ACN, 4) 5 X 1 mL of pure ACN, 5) 10 X 5 mL of 
dichloromeifliane (DCM). 

Cysteine-containing peptides were then eluted from the resin witt 5% TEA in 
25 DCM uang continuous flow methodology. The resulting peptide solutioii was dried 
and reconstituted with 1% acetic acid in water. The reconstituted peptide solution was 
directiy subjected to automated 2D-LC/MS/MS analysis (as described above) without 
further treatment. MS analysis combined with database searching yielded both 
identities and quantities of the proteins. 
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Samples were taken from the mixture before and after acid elution for MS 
analysis to compare the overall recovery of cysteine-containiiig peptides with or 
wifhout using the ALICE approach. The results are provided below, with reference to 
the following published sequence of bovine serum albimiin (using single letter amino 
5 acid code): 



1 


MKVWTFISLL 


LLFSSATYSRG 


VFRRDTHKSE 


lAHRFKDLGE 


41 


EHFKGLVLIA 


FSQYLQQCPF 


DEHVKLVNEL 


TEFAKTCVAD 


81 


ESHAGCEKSL 


HTLFGDELCK 


VASLRETYGD 


MADGCEKQEP 


121 


ERNECFLSHK 


DDSPDLPKLK 


PDPNTLCDEF 


KADEKKFWGK 


161 


YLYEIARRHP 


YFYAPELLYY 


ANKYNGVFQE 


CCQAEDKGAC 


201 


LLPKIETMRE 


KVLTSSARQR 


LRCASIQKFG 


ERALKAWSVA 


241 


RLSQKFPKAE 


FVEVTKLVTD 


LTKVHKECCH 


GDLLECADDR 






WL/ 1 IOOI\l-l\C 






321 


IPENLPPLTA 


DFAEDKDVCK 


NYQEAKDAFL 


GSFLYEYSRR 


361 


HPEYAVSVLL 


RLAKEYEATL 


EECCAKDDPH 


ACYSTVFDKL 


401 


KHLVDEPQNL 


IDQNCDQFEK 


LGEYGFQNAL 


IVRYTRKVPQ 


441 


VSTPTLVEVS 


RSLGKVGTRC 


CTKPESERMP 


CTEDYLSLIL 


481 


NRLCVHEKT 


PVSEKVTKCC 


TESLVNRRPC 


FSALTDETY 


521 


VPKAFDEKLF 


TFHADICTLP 


DTEKQIKKQT 


ALVELLKHkP 


561 


KATEEQLKTV 


MENFVAFVDK 


GCAADDKEAC 


FAVEGPKLW 


601 


STQTALA 
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Peptides identified firom peptide mixtures before and after using ALICE for isolation 



of cysteine-containing peptides 



Peptides identified by LC-iy/IS/MS and 
database searcliing from the sample 
after enzymatic digestion but before 
reaction with ALICE (including both 
cysteine containing and non-cysteine 
containing peptides) 


Peptides identified by LC-MS/MS and 
database searching from the final 
sample eluted from the ALICE resin 
(exclusively cysteine-containing 
peptides) 


Position, based on SEQ ID N0.1 


Position, based on SEQ ID N0.1* 


508-523 


76-88 


508-523 


460-468 


402-412 


437-451 


89-100 


483-489 


106-117 


89-100 


267-280 


123-130 


198-204 


298-309 


106-117 


286-297 


310-318 


267-280 


581-587 


499-507 


161-167 


375-386 


45-65 


199-204 


123-130 


499-507 


310-318 


198-204 


286-297 


360-371 


76-88 


300-309 


460-468 


562-568 


588-597 


387-399 


421-433 


123-138 


52-65 


375-386 


529-544 


95-100 


139-151 


319-340 


300-309 


588-597 


413-420 


223-228 


413-420 


533-544 


529-544 


469-482 


598-607 


548-557 






35-44 


172-183 






45-65 


319-340 






347-359 


469-482 






341-353 


435-451 






354-359 


413-424 






168-180 


387-399 






361-371 


66-75 






581-597 


549-557 






569-680- 


139-151 







* Two highlighted cysteine-containing peptides: CASIQK (residues 223-228) and 
5 LCVLHEK (residues 483-489) were only detected firom the final sample eluted from 
the ALICE resin. 
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This study demonstrated that nonspecific binding associated with the use of 
conventional reagents is not a problem using the compounds of the inyentidn, since all 

the peptides eluted from the resin after washing are exclusively cysteine-containing 

■J 

peptides. This is because the compounds of the invention permit the use of much 
5 more stringent washing conditions, as compared to conventional ICAT reagents. 
Thus, the compounds of the invention provide lower "noise", better dynamic range 
and sensitivity in subsequent MS analysis. 

More specifically, in this study, 33 out of 35 cysteines were captured. Only 
one Cys-containing peptide, YNGVFQECCQAEDK (residues 1 84 - 197 of SEQ ID 

10 NO.l) was not recovered either before or after isolation. CASIQK (residues 223-228 
of SEQ ID NO.l) and LCVLHEK (residues 483-489 of SEQ ID NO.l) were only seen 
after isolation. This is likely due to the better dynamic range and sensitivity provided 
by the compound of the invention. Although not measured, overall recovery 
percentage is anticipated to be more than 75%. Steric hindrance in the capturing step 

15 is not a problem, since the peptides containing more than one cysteine were all 

uniformly modified by ALICE, the model compound of the invention. From all the 
CID experiments, no fragments observed were from the ALICE label, indicating that 
the compoimd would not interfere with the MS/MS experiments and subsequent 
protein identification by fragment-ion based database searching. 

20 

EXAMPLE 4 - CAPTURING C YSTEINE-CONTAINING PEPTIDES USING 
ALICE, SIMPLE PROTEIN MIXTURES, AND AUTOMATED LC/MS and 2D-' 

LC/MS • . : • 

Two mbctures were prepared, each containing eight proteins. This following 
25 table illustrates the composition of these mixtures. . 
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Composition of two protein mixtures 



Protein Name 


Protein JMDxtare A (nniol) 


Protein Mixture B (nmoi) 


Lysozyme 


10 


50 


a-Iactalbumin 


50 


10 


Ovalbumin 


25 


50 


Catalase 


50 


25 


P-lactoglobulin 


38 


50 


BSA 


50 


38 


Ribonuclease 


50 


50 


Trypsinogen 


50 


50 



Protein mixture A and protein mixture B (323 nmol of total protein were 
solubilized, respectively, in 325 |iL of 6 M urea, 5% 3-[(3-cholamidopropyl)- 

5 dimethylammonio]-l-propanesulfonate (CHAPS), and 50 niM Tris HCl. 1 1 ,3 ^imole 
of tributyl phosphine (TBP) pre-dissolved in 6.3 jiL of isopropanol (IP A) was added 
to each solubilized protein mixture and the resulting solutions were incubated at 37''C 
for one hour. To each protein mixture was added 200 nL of 50mM Tris-HCl (pH 8.0) 
and 34 |imol of metiianethiosulfonate (MMTS) predissolved in 3.5 |iL of IPA, and 
la themixtures were reacted for 30 mmutes. Each pr^^^ 

times with 50 niM Tris-HCl (pH 8.0) and digested with trypsin (5% w/w) at 37''C for 
16 hours. From the total peptide mixtures, 42% (21% from each mixture) was 
retained for future work, and the remaining 58% (1 87 nmol total protein) was dried 
and then reconstituted with 1.5 mL of 60% acetonitrile (ACN)/40% lOOmM Tris-HCl 

15 (pH 7.0). Disulfide bonds on the cysteme-containing peptides were reduced by TBP 
(18.7 funol) at 37'*C for one hour. Each solution was then vacuum concentrated for 10 
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minutes to remove excess TBP and ACN, and reconstituted to the previous volume 
using ACN. To each solution was added 55 ^mol of either Ught or heavy ALICE 
resins (3X TBP molar equivalent) and the solutions were stirred for 1 hour at room 
temperature. The reactions were quenched by the addition of P-mercaptoethanoI 

5 (BME) to a final concentration of 1 %. 

The protein mixtures were then combined and loaded onto a column (flitted 
glass type wilii Teflon cockstop) and the resin was washed with the following solvent 
in sequence: (i) 50 mL of at 50:50 ACN:water solution, (ii) 50 mL of pure AGN, (iii) 
50 mL of a 50:50 ACNidichloromethane (DCM) solution, and (iv) 50 mL of pure 

10 DCM- 

Cysteine-containing peptides were isolated by elution with 3x5 mL of 5% 
TFA in DCM using continuous flow methodology, 1 5 minute incubations with 
intermittent shaking, then 15 mL of continuous flow. The resulting peptide solution 
was dried and reconstituted with 2% ACN in 1% acetic acid/water. The reconstituted 

15 peptide solution was directly subjected to HPLC-MS MicroMass Q-ToF^ instrument 
(MicroMass, Manchester, UK) and 2D-LC-MS/MS (Fmnigan LCQ Deca, Fmnigan 
Corporation, San Jose, CA) analysis without further treatment These analyses, 
combined vsdth database searching, yielded both identities and quantities of the 
proteins. The chemical reactions for the isolation of cysteine-containing peptides are 

20 illustrated in the following scheme. 



24 



wo 02/48717 



PCT/USOl/50745 



o ^ HS^^Cys Peptides 
O Polymer 




OH 



1. Capture 



pH = 7.0-7.5 
O 




lO O Polymer 

2.Elution| 5%TFA 




O 



O-^PolymerV^ "2^ ^ ^ ^ IT S ^Cys; 



\ 



5 The results of the mass-spectrometric analysis are provided in the following table. In 
this table, M# = oxidized methionine residue; C* = light and heavy ALICE labeled 
cysteine residue. 
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Table - Sequence identification and quantitation of the components of a 
protein mixture using ALICE. 



Protein Name 


Peptide 

Charge 
State 


Peptide Sequence identified/ 
SEQ ID NO: 


Obs'd 

JKatlO/ 

Mean ±SD 


Exp. 
Katio 


% 


a-lactoalbumin 


432.20// 
2 


(K)C*EVFR(E) 
SEQIDNb:3 


4.97 


5 


0.6 


P-lactoglobulin/ 


1107.84/ 
/3 


(K)YLLFC*M#ENSAEPEQSLVC*QC 
*LVR(T):SEQIDNO:4 


0.76 
0.76 ± 0.01 


0.76 


0.3 




934.94// 
2 


(R)LSFNPTQLEEQC*HI(-) 
: SEQ ID NO:5 


0.77 






Catalase 


654.34// 
2 


(R)LC*ENIAGHLK(D) : 
SEQ ID NO: 6 


2.1 
0.02 ± 0.09 


2 


1 




436.56// 

3 


(R)LC*ENIAGHLK(D): 

SEQIDNO:6 


1.93 








979.00// 
2 


(R)LGPNYLQIPVNC*PYR(A) 
:SEQIDNO:7 


2.01 






Lyso^me 


1062.49/ 
/I 


(R)C*ELAAAM#K(R) 
:SEQIDNO:8 


0.2 


0.2 


02 


Ovalbunun 


739.80// 
2 


(A)SM#EFCFDVFK(E) 
:SEQIDNO:9 


0.61 
0.58 ± 0.05 


0.5 


16 




700.85// 
2 


(R)ADHPFLFC*IK(H): 
SEQ ID NO: 10 


0.6 








467.57// 
3 


(R)ADHPFLFC*IK(H) 
:SEQIDNO:10 


0.52 








838.44// 
2 


(R)YPILPEYLQC*VK(E) 
SEQIDNO:ll 


0.59 






Ribonuclease 


1189.08/ 

a 


(K)fflrVAC*EGNPYVPVHFDASVO 
SEQ ID NO: 12 


1.08 
1.00*0.11 


1 


0.4 




793.06// 
3 


(K)HnVAC*EGNPYVPVHFDASV(-) 
SEQ ID NO: 12 


1.16 
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Protein Name 


Peptide 
Mass// 
Charge 
State 


Peptide Sequence identifled/ 
SEQIDNO: 


Obs'd 
Ratio/ 

Mean ±SD 


Exp. 
Katio 


% 
Error 




595.04// 
4 


(K)HnVAC*EGNPYVPVHFDASV(-) 
SEQIDN0:12 


1.17 








706.60// 
4 


(R)C*KPVNTFVHESLADVQAVC*S 
QK(N) 
SEQIDN0:13 


0.89 








922.40// 
2 


(A)CEGNPYVPVHFDASV(-.) 
aa6-22ofSEQIDNO:12 


1.03 








608.63// 
3 


(F)VHESLADVQAVCSQK(N) 
aa 6-24 of SEQ ID NO: 12 


0.96 








865.5//1 


(K)HIIVAC*(E) 
aal-8ofSEQIDNO:14 


1.03 








433.25// 
2 


(K)HnVAC*(E) 
aa 1-8 of SEQ ID NO: 14 


0.9 








1239.5// 
1 


(Y)STM#SITDC*R(E) 
SEQ ID NO: 14 


0.9 








620.25// 
2 


(y)STM#SITDC*R(E) 
SEQ ID NO: 14 


0.84 






Trypsinogen 


580.3//2 


(A)PILSDSSC*K(S) 
aa 5-15 of SEQ ID NO: 15 


0.87 
1.02 ± 0.10 


1 


2 




1230.61/ 
/I 


(K)APILSDSSC*K(S) 
aa4-15ofSEQIDNO:15 


1.01 








615.80// 
z 


(K)APILSDSSC*K(S) 
aa 4-15 oi ojiv^ UJ rivi.i J 


1.18 








892.95// 
2 


(K)C*LKAPILSbSSC*K(S) 
SEQIDNO:15 


' 1.02 








595.63/A 
3 


(K)C*LKAPILSDSSC*K(S) 
SEQ ID NO: 15 


1.04 








958.41// 
2 


(K)DSC*QGDSGGPWC*SGK(L) 
SEQ ID NO: 16 


0.98 
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Table (cont'd) 



Pr teinName 


Peptide 
Mass// 

State 


Peptide Sequence identified/ 
SEQIDNO: 


Obs'd 
Ratio/ 


Exp- 
Ratio 


% 
Error 


BSA 


1141.6// 
1 


(C)C*TESLVNR(R) 
aa 497-506 of SEQ ID N0:1 


1.5 

1.35 ± 0:10 


1.32 


2.3 




566.25// 

2 


(C)C*TESLVNR(R) 
aa 497-506 of SEQ ID NO:l 


128 








623.35// 
2 


(H)TLFGDELC*K(V) 
aa 92-102 of SEQIDNO:! 


1.2! 








1194.02/ 
/2 


(K)C*C*AADDKEAC*FAVEGPK(L) 
aa 577-595 of SEQ ID NO:l 


1.24 








796.35// 
3 


(K)C*C*AADDKEAC*FAVEGPK(L) 
aa 577-595 of SEQ ID N0:1 


1.23 








2 


aa 496-506 of SEQ ID NO:I 


1.34 








3 


niC^r>DPHAC*YSTVFDKLKfH^ 
aa 386-402 of SEQ ID NO:l 


1.35 








2 


aa 584-595 of SEQ ID NO:l 


1.3 








3 


(K)EC*C*DKPLLEK(S) 
aa 300-3 1 1 of SEQ ID N0:1 


1.4! 








911.50// 
1 


(K)GAC*LLPK(I) 
aa 198-206 of SEQ ID NO:l 


1.48 








638.80// 
2 


(K)LFTFHADIC*(T) 

aa 525-535 of SEQ ID NO:l 


1.35 








638.80// 
2 


(K)IJTFHADIC*(T) 
aa 525-535 of SEQ ID NO:l 


1.51 








613.65// 
3 


(K)LKEC*C*DKPLLEK(S) 
aa 298-3 1 1 of SEQ ID NO: 1 


1.51 








3 


(K)LKPDPNTLC*DEFK(A) 
aa 139-153 ofSEQIDNO:I 


1.21 








786.89// 
2 


(K)SLHTLFGDELC*K(V) 
aa 89-102 of SEQIDNO:! 


L35 
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Table (cont'd) 



Protein Name 


Peptide 
Mass// 
Oharge 
State 


Peptide Sequence identified/ 
SEQIDNO: 


Obs'd 
Ratio/ 
Kfean ^SD 


Exp. 
Ratio 


% 

Error 




524.92// 
3 


(K)SLHTLFGDELC*K(V) 
aa 89-102 of SEQ ID NO:l 


1.35 








885.37// 
2 


(K)TC*VADESHAGC*EK(S) 
aa 76-90 ofSEQIDNOrl 


1.52 








590.58// 
3 


(K)TC*VADESHAGC*EK(S) 
aa 76-90 of SEQ ID NO: 1 


1.52 








591.62// 
3 


(K)VTKC*C*TESLVNR(R) 
aa 493-506 of SEQ ID NO: 1 


1.19 








798.86// 
2 


(K)YIC*DNQDTISSK(L) 
aa 286-299 of SEQ ID NO:l 


1.36 








1027.43 
7/2 


(K)YNGVFQEC*C*QAEDK(G) 
aa 1 84-1 99 of SEQ ID NO: 1 


1.2 








859.43// 
1 


(R)C*ASIQK(F) 
aa 223-230 of SEQ ID NO: 1 


1.46 








430.21// 
2 


(R)C*ASIQK(F) 
aa 223-230 of SEQ IDNO:l 


1.3 








1051.56 
//I 


(R)LC*VLHEK(T) 
aa 48 1 -488 of SEQ ID NO: 1 


1.35 








526.284 
//2 


(R)LC*VLHEK(T) 
aa 481-488 of SEQ ID NO:l 


1.27 








947.45// 
2 


(R)M#PC*TEDYLSLILNR(L) 
aa 468-482 of SEQ IDNO:l 


1.36 








631.97// 
3 


(R)M#PC*TEDYLSLILN^ 
aa 468-482 of SEQ ID NO: 1 


1.25 








1027.97 

111 


(R)NEC*FLSHKDDSPDLPKCL) 
aa 123-140 of SEQ ID NO:l 


1.27 








1017.50 

112 


(R)RPC*FSALTPDETYVPK(A) 
aa 505-52 1 of SEQ ID NO: 1 


L41 








61Z.612 
113 


(R)RPC*FSALTPDETYVPK(A) 
aa 505-521 of SEQIDNO:! 


1.39 
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This study demonstrated that quantification by ALICE is accurate after taking 
into account the following factors: isotopic imptirity of the heavy ALICE; different 
elation profile of the same peptides modified by heavy and light ALICE; non-specific 
en2ymatic cleavage. This improved quantitation accuracy by ALICE is even more 
5 evident when multiple cysteine-containing peptides are present. Peptides without any 
cysteine residue were rarely seen in the final captured peptide mixture since more 
stringent washing conditions completely removed non-specifically bound species. 
Furthermore, the use of large amounts of organic solvents also minimized the loss of 
peptides throughout the procedure. Finally, simplification of the peptide mixture by 

10 isolating cysteine-containing peptides in combination with the novel automated 2D- 
LC/MS design increase the overall sample loading capacity, the speed of sample 
analysis and the dynamic range and sensitivity of the MS analysis of protein inixtures. 
This experiment also further confirmed that reaction between ALICE and cysteine- 
containing peptides is efficient and stoichiometric and the effect of steric hindrance is 

15 not a concern since peptides with more than one cysteine residue were modified 
completely by ALICE. For example, a tryptic peptide with three cysteine residues 
derived from lyso2yme (NLC*NIPC*SALLSSDITASVNC*AK, SEQ ID NO:2) was 
uniformly labeled with either heavy or light ALICE (the mass difference (not shown) 
between this heavy and light mass pairs is exacfly 30 Da). Both light and heavy 

20 ALICE labeled peptides were effectively picked by the automated 2D-LC/LC/MS 
system for MS/MS analysis even though the peak intensity for the light ALICE 
labeled peptide is very low. Subsequent database searching identified the peptide as 
NLC*l^C*SALLSSpn*ASVNC*AK 
modified by light and heavy ALICE, respectively. 

25 

All publications cited in this specification are incorporated herein by reference 
hereiiL While the invention has been described with reference to a particularly 
preferred embodiment, it will be appreciated that modifications can be made without 
departing from the spirit of the invention. Such modifications are intended to fall 
30 within the scope of the appended claims. 
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WHAT IS CLAIMED IS: 

1 . A method for the analysis of mixtures containing proteins, said method 
comprising the steps of: 

(a) reducing the disulfide bonds in the proteins of a sample, tiiereby 
providing thiol groups in cysteine-containing proteins; 

(b) blocking free thiols with a blocking reagent in the sample; 

(c) digesting the proteins in the sample to provide peptides; 

(d) reducing the disulfide bonds in the digested peptides, thereby 
providing thiol groups in cysteine-containing peptides for reaction; 

(e) reacting cysteine-containing peptides in the sample with a reagent, 
wherein said reagent comprises a thiol-specific reactive group which is attached to a 
polymer tag via a linker, wherein the linker can be differentially labeled with stable 
isotopes and wherein the polymer tag forms a covalent bond with the cysteine- 
containing peptides; 

(f) washing the polymer-bound peptides to remove non-covalently 

bound species; 

(g) eluting the cysteine-containing peptides; and 

(h) subjecting the eluted peptides to quantitative mass spectrometry 

(MS) analysis. 

2. The method according to claim 1 , wherein said method further 

coinprises the steps of : 

performing stepis (a) to (d) on a second sami>le; 

reacting cysteine-containing labels in the second sample with a stable 
isotope-labeled form of the reagent, wherein in reacting step (e), the reagent used is a 
non-isotope labeled forai the reagent; 

mixing the peptides of the reacted sample following step (e) and the 
reacted second sample; and 

performing steps (g) and (h) on the peptides in the mixture. 
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3. The method according to claim 1, wherein the reagent comprises a 
thiol-specific reactive group is selected from tiie group consisting of a-haloacetyl and 
maleimide. 

4. The method according to claim 1 , wherein the blocking reagent is 
methyl methane thiosulfonate. 

5. The method according to claim 1 , wherein the reagent has the formula: 

Al - Linker - A2 - polymer 
wherein A 1 is the thiol-reactive group and A2 is an acid labile group to which 
the polymer is bound. 

6. The method according to claim 5, herein the acid-labile group bound 
to the polymer has the structure: 

O-polymer 




7. The method according to claim 5, wherein the polymer in the reagent is 
a polymer resin. 
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8. The method according to claim 7, wherein the polymer resin is a 
homopolyiner or heteropolymer comprising a polymer selected from the group 
consisting of polystyrene and polyethylene glycol. 

9. The method according to claim 8, vdierein the linker contains a 
substitution of at least six hydrogen atoms with a stable isotope. 

10. The method according to claim 9, wherein the linker contains ten stable 
isotopes. 

1 1 . The method according to claim 9, wherein the stable isotope is 
deuterium. 

12. The method according to claim 1 > wherein the non-isotope labeled 
reagent is 

O-polymer 
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13. The method according to claim 1 , wherein the isotope labeled reagent 
has the formula: 




14. The method according to claim 1 , wherein the eluted peptides are 
subjected to high-performance liquid chromatography-mass spectrometry (MS) 
analysis, two-dimensional liquid chromatography MS, or MS/MS analysis. 

15. The method according to claim 1 , wherein the proteins are digested 
using trypsin. 

16. A compoimd useful for capturing cysteine-containing peptides, which 
is selected from the group consisting of a thiol-specific reactive group attached to a 
non-biological polymer via a linker. 

1 7. The compound according to claim 1 6, wherein the linker contains a 
substitution of aUeast six atoms \^th a stable isotope^^^ 

1 8. The compound according to claim 16, wherein the linker contains ten 
stable isotopes. 

19. The compound according to claim 17, wherein the stable isotope is 
deuteriimi. 
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20. The compound according to claim 1 6, selected from the group 
consisting of: 



O-polymer 



and 




21 . A reagent kit for the analysis of proteins by n^iass spectral analysis that 
comprises a compound of cLaim 16. 

22. The reagent kit of claim 21 which comprises a set of substantially 
identical differentially labeled cysteine-tagging res^ents. 

23. The reagent kit of claim 22 fiirttier comprising one or more proteolytic 
enzymes for use in digestion of proteins to be analyzed. 
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SEQUENCE LISTING 



<110> Genetics Instiliute, Inc^ 

.<i20> ACID-IABil.E ISOTpPE-CODED EXTRACTANT (ALICE) AND ITS uiSE IN 
QUANTITATIVE MASS SPECTROMBTRIC ANALYSIS OF PROTEIN MIXTURES 

<130> GI5412APCT 

<150> 60/242,643 
<151> 2000-10-23 

<160> 16 

<170> Patentln version 3*1 

<210> 1 

<211> 604 

<212> PRT 

<213> Bovine Serum Albumin 

<400> 1 

Met Lys Trp Val Thr Phe lie Ser Leu Leu Leu Leu Phe Ser Ser Ala 
1 5 10 15 

Thr Tyr Ser Arg Gly Val Phe Arg Arg Asp Thr His Lys Ser Glu lie 
20 25 30 

Ala His Arg Phe Lys Asp Leu Gly Glu Glu His Phe Lys Gly Leu Val 
35 40 45 

Leu He Ala Phe Ser Gin Tyr Leu Gin Gin Cys Pro Phe Asp Glu His 
50 55 60 

Val Lys Leu Val Asn Glu Leu Thr Glu Phe Ala Lys Thr Cys Val Ala 
65 70 75 80 

Asp Glu Ser His Ala Gly Cys Glu Lys Ser Leu His Thr Leu Phe Gly 
85 90 95 

Asp Glu Leu Cys Lys Val Ala Ser Leu Arg Glu Thr Tyr Gly Asp Met 
100 105 110 

Ala Asp Cys Cys Glu Lys Gin Glu Pro Glu Arg Asn Glu Cys Phe Leu 
115 120 125 

Ser His Lys Asp Asp Ser Pro Asp Leu Pro Lys Leu Lys Pro Asp Pro 
130 135 140 

Asn Thr Leu Cys Asp Glu Phe Lys Ala Asp Glu Lys Lys Phe Trp Gly 
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145 150 155 160 

Lys Tyr Leu Tyr Glu lie Ala Arg Arg His Pro Tyr Phe Tyr Ala Pro 
165 170 175 

Glu Leu Leu Tyr Tyr Ala Ash Lys Tyr Asrx Gly Val Phe Gin Glu Cys 
180 185 190 

Cys Gin Ala Glu Asp Lys Gly Ala Cys Leu Leu Pro Lys lie Glu Thr 
195 2O0 205 

Met Arg Glu Lys Val Leu Thr Ser Ser Ala Arg Gin Arg Leu Arg Cys 
210 215 220 

Ala Ser lie Gin Lys Phe Gly Glu Arg Ala Leu Lys Ala Trp Ser Val 
225 230 235 240 

Ala Arg Leu Ser Gin Lys Phe Pro Lys Ala Glu Phe Val Glu Val Thr 
245 250 255 

Lys Leu Val Thr Asp Leu Thr Lys Val His Lys Glu Cys Cys His Gly 
260 265 270 

Asp Leu Leu Glu Cys Ala Asp Asp Arg Ala Asp Leu Ala Lys Tyr lie 
275 280 285 

Cys Lys Asn Gin Asp Thr He Ser Ser Lys Leu Lys Glu Cys Cys Asp 
290 295 300 

Lys Pro Leu Leu Glu Lys Ser His Cys He Ala Glu Val Glu Lys Asp 
305 310 315 320 

Ala lie Pro Glu Asn Leu Pro Pro Leu Thr Ala Asp Phe Ala Glu Asp 
325 330 335 

Lys Val Cys Lys Asn Tyr Gin Glu Ala Lys Asp Ala Phe Leu Gly Ser 
340 345 350 

Phe Leu Tyr Glu Tyr Ser Arg Arg His Pro Glu Tyr Ala Val Ser Val 
355 360 365 

Leu Leu Arg Leu Ala Lys Glu Tyr Glu Ala Thr Leu Glu Glu Cys Cys 
370 : 375 380 

Ala Lys Asp Asp Pro His Ala Cys Tyr Ser Thr Val Phe Asp Lys Leu . 
385 390 395 400 

Lys His Leu Val Asp Glu Pro Gin Asn Leu He Asp Gin Asn Cys Asp 
405 410 415 

Gin Phe Glu Lys Leu Gly Glu Tyr Gly Phe Gin Asn Ala Leu He Val 
420 425 430 
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Arg Tyr Thr Arg Lys Val Pro Gin Val Ser Thr Pro Thr Leu Val Glu 
435 440 445 

Val Ser Arg Ser Leu Gly Lys Val Gly Thr Arg Cys Cys Thr Gly Pro 
450 455 460 

Glu Ser Glu Arg Met Pro Cys Thr Glu Asp Tyr Leu Ser lie Leu Asn 
465 470 475 480 

Arg Leu Cys Val His Glu Lys Thr Pro Val Ser Glu Lys Val Thr Lys 
485 490 495 

Cys Cys Thr Glu Ser Leu Val Asn Arg Arg Pro Cys Phe Ser Ala Leu 
500 505 510 

Thr Asp Glu Thr Tyr Val Pro Lys Ala Phe Asp Glu Lys Leu Phe Thr 
515 520 525 

Phe His Ala Asp lie Cys Thr Leu Pro Asp Thr Glu Lys Gin He Lys 
530 535 540 

Lys Gin Thr Ala Leu Val Glu Leu Leu Lys His Lys Pro Lys Ala Thr 
545 550 555 560 

Glu Glu Gin Leu Lys Thr Val Met Glu Asn Phe Val Ala Phe Val Asp 
565 570 575 

Lys Cys Cys Ala Ala Asp Asp Lys Glu Ala Cys Phe Ala Val Glu Gly 
580 585 590 

Pro Lys Leu Val Val Ser Thr Gin Thr Ala Leu Ala 
595 600 



<210> 2 

<211> 23 

<212> PRT 

<213> Peptide from Lysozyme 

<400> 2 

Asn Leu Cys Asn He Pro Cys Ser Ala Leu , Leu Ser Ser Asp lie Thr 
1' ' . • . ■ 5 .- -^-O- ^ ■ ■• -is- ■ 

Ala Ser Val Asn Cys Ala Lys 
20 



<210> 3 
<211> 7 
<212> PRT 
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<213> Peptide from alpha-lactoalbumin 
<400> 3 

Lys. Cys Glu Val Phe Arg Glu 

i ' 5- •■ 



<210> 4 

<211> 25 

<212> PRT 

<213> Peptide from beta-lactoglobulin 

<400> 4 

Lys Tyr Leu Leu Phe Cys Met Glu Asn ser Ala Glu Pro Glu Gin Ser 
1 5 10 15 

Leu Val Cys Gin Cys Leu Val Arg Thr 
20 25 

<210> 5 
<211> 15 
<212> PRT 

<213> Peptide from beta-lactoglobulin 
<400> 5 

Arg Leu Ser Phe Asn Pro Thr Gin Leu Glu Glu Gin Cys His lie 
15 10 15 



<210> 6 

<211> 12 

<212> PRT 

<213> Peptide from Catalase 

<400> 6 

Arg Leu Cys Glu Asn He Ala Gly His Leu Lys Asp 
1 5 10 



<2io> : 7 • 

■<2il> 17 
<212> PRT 

<213> Protein from catalase 
<400> 7 

Arg Leu Gly Pro Asn Tyr Leu Gin He Pro Val Asn Cys Pro Tyr Arg 
15 10 15 
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Ala 



<2lO> 8 

<2li> 10 

<212> PRT 

<213> Protein from lysozyme 



<400> 8 

Arg Cys Glu Leu Ala Ala Ala Met Lys Arg 
15 10 



<210> 9 
<211> 12 
<212> PRT 

<213> Protein from ovalbumin 
<400> 9 

Ala Ser Met Glu Phe Cys Phe Asp Val Phe Lys Glu 
15 10 



<210> 10 

<211> 12 

<212> PRT 

<213> Peptide from ovalbumin 

<400> 10 

Arg Ala Asp His Pro Phe Leu Phe Cys lie Lys His 
1 5 10 



<210> 11 
<211> 14 
<212> PRT 

<213> Peptide from . ovalbumin 
<406> "11 

Arg Tyr Pro lie Leu Pro Glu Tyr Leu Gin Cys Val Lys Glu 
15 10 



<210> 12 
<211> 21 



Page 5 



wo 02/48717 



PCT/USOl/50745 



<212> PRT 

<213> Peptide from ribonuclease 



<400> 12 



Lys His lie 
1 



lie Val Ala Cys Glu Gly* Asn Pro Tyr Val Pro Val His 
5 10 15 



Phe Asp Ala Ser Val 
20 



<210> 13 

<211> 24 

<212> PRT 

<213> Peptide £rom ribonuclease 



<400> 13 

Arg Cys Lys Pro Val Asn Thr Phe Val His Glu Ser Leu Ala Asp Val 
15 10 15 

Gin Ala Val Cys Ser Gin Lys Asn 
20 



<210> 14 
<211> ' 11 
<212> PRT 

<213> Ppetide from ribonuclease 



<400> 14 

Tyr Ser Thr Met Ser lie Thr Asp Cys Arg Glu 
15 10 



<210> 15 

<211> 15 . 
<212> ".PRT * 

<213> Peptide from trypsinbgen 
<400> 15 

Lys Cys Leu Lys Ala Pro lie Leu Ser Asp Ser Ser Cys Lys Ser 



1 



5 



10 



15 
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<210> 16 

<211> 18 

<212> PRT 

<213> Peptidie from trypsinogen 



<400> 16 



Lys Asp Ser Cys Gin Gly Asp Ser Gly Gly Pro Val Val Cys Ser Gly 
^ 5 10 15 



Lys Leu 
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